EReXS: Event and Relations Extraction for SWHi

نویسندگان

  • Proscovia Olango
  • Henk Ellermann
  • Hamish Cunningham
  • Valentin Tablan
  • Diana Maynard
  • Kalina Bontcheva
  • Marin Dimitrov
چکیده

”Automatic event extraction from fulltext resources is a combination of human language technology (HLT) and semantic web technologies. It can also be done on the base of purely statistical means with minimal linguistic knowledge”. This thesis introduces a semi-automated method based on the HLT approach. The method uses an existing information extraction system called ANNIE, A Nearly-New Information Extraction System (developed by Hamish Cunningham, Valentin Tablan, Diana Maynard, Kalina Bontcheva, Marin Dimitrov and others). Further text analysis is supported by WordNet and parsers that help in the automatic extraction of historical events and their relations to objects of the human society. Although the method is developed for fulltext resources in the field of history, it is anticipated that it shall also be applied to e-resources in other fields for automatic extraction of historical events. The subject of history is well reckoned with its chronological record of true events, leading from the past to the present and even into the future. When used as the name of a field of study, history refers to the study and interpretation of the record of human societies. Historical events extraction, therefore involves the identification of past events and their semantic relations to human society. 1Thierry Declerck, Automatic event extraction from text on the base of linguistic and semantic annotation, DFKI Language Technology Lab. German Research Center for Artificial Intelligence GmbH, 2005 2http://en.wikipedia.org/wiki/History

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integration of Static Relations to Enhance Event Extraction from Text

As research on biomedical text mining is shifting focus from simple binary relations to more expressive event representations, extraction performance drops due to the increase in complexity. Recently introduced data sets specifically targeting static relations between named entities and domain terms have been suggested to enable a better representation of the biological processes underlying ann...

متن کامل

PubMed-Scale Event Extraction for Post-Translational Modifications, Epigenetics and Protein Structural Relations

Recent efforts in biomolecular event extraction have mainly focused on core event types involving genes and proteins, such as gene expression, protein-protein interactions, and protein catabolism. The BioNLP’11 Shared Task extended the event extraction approach to sub-protein events and relations in the Epigenetics and Post-translational Modifications (EPI) and Protein Relations (REL) tasks. In...

متن کامل

Annotating Inter-Sentence Temporal Relations in Clinical Notes

Owing in part to the surge of interest in temporal relation extraction, a number of datasets manually annotated with temporal relations between event-event pairs and event-time pairs have been produced recently. However, it is not uncommon to find missing annotations in these manually annotated datasets. Many researchers attributed this problem to ”annotator fatigue”. While some of these missin...

متن کامل

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

Detecting Entity Relations as a Supporting Task for Bio-Molecular Event Extraction

Recently, the focus in the BioNLP domain has shifted from binary relations to more expressive event representations, largely owing to the international popularity of the BioNLP Shared Task (ST) of 2009. This year, the ST’11 provides a further generalization on three key aspects: text type, subject domain, and targeted event types. One of the supporting tasks established to provide more finegrai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006